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ABSTRACT 

The Kepler mission lias discovered a large number of planetary systems. We analyze the implications 
of the discovered single/multi-exoplanet systems from Kepler's data. 

As done in previous works, we test a simple model in which the intrinsic occurrence of planet is an 
independent process, and with equal probability around all planet producing stars. This leads to a 
Poisson distribution for the intrinsic number of planets around each host. However, the possibility of 
zero/low mutual inclination is taken into account, creating a correlation between detecting different 
planets in a given stellar system, leading to a non Poisson distribution for the number of transiting 
planets per system. Comparing the model's predictions with the observations made by Kepler, we 
find that the correlation produced by planarity is insufficient and a higher correlation is needed; either 
the formation of one planet in the system enhances the likelihood of other planets to form, and/or 
that some stars are considerably more fertile than others. Kepler's data presents evidences that both 
correlations might play a part, in particular a significant dependency in the radial distribution of 
planets in multi-planet systems is shown. Followup observations on Kepler planet's hosts can help 
pinpoint the physical nature of this correlation. 



1. INTRODUCTION 



The Kepler satellite (jBorucki et al.|[20lol ) has opened a 
new era in exoplanet research. Since Kepler's launch over 
2300 exoplanet candidates have bee n reported, inost of 
which are Neptune sized or smaller (jBorucki et al.l[201ll : 
iBatalha et al.l[2012l ). In the following, we refer to planet 
candidate s as planets due to the low occurrence of fals e 
positives (jMorton fc Johnsonll2011l : lLissauer et al.ll2012[ ). 
The observational data from Kepler's first four and six- 
teen months of act ivity respectively as pre sented in 
iBorucki et all ()2011f ) and IBatalha et al.l (|2012f ). revealed 
a large number of multi-exoplanet systems. The large 
amount of data gives a significant boost to the statistical 
discussion on multi-exoplanet systems. In particular, one 
may deduce from the data informa tion about the mutual 
influences of planets on ea c h other (iRagozzine fc HolmanI 
l2OT0t iLissauer et al.ll20TlL iTremaine fc Dongll20ll l 

The simplest model, which assumes that the occur- 
rence of transiting planets is an independent process and 
that all stars are equally likely to produce planets, re- 
sults in a Poisson distribution of the number of transit- 
ing planets in a system. However, the observed transiting 
planet distribution from Kepler has been shown not to 
fit a Poisson distribution (.Lissaucr et a l. 2011). This can 
easily be seen by comparing the ratio of the number of 
systems with one detected planet to the number of sys- 
te ms with two detected planets from the data published 
in IBatalha et al.l (|2012D . For a Poisson distribution, this 
ratio is given by ^/2, where /i is the mean of the Pois- 
son distribution. Therefore, we expect to have approxi- 
mately fjL ~ 0.344. Using this mean, it follows that there 
should only be 2.4 systems with four transiting planets. 
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0.16 with five transiting planets and 9 • lO^'^ systems with 
six transiting planets. These numbers are very different 
from the observed data which has 27 systems with four 
transiting planets, 8 systems with five transiting planets 
and one system with six transiting planets (see 121) • It is 
evident that the real distribution has a longer tail since 
Kepler's data indicates that there are more systems with 
a large number of transiting planets than predicted from 
a Poisson distribution. 

The long tail can arise if the planet occurrence is a 
dependent process, where the formation or detection of 
one planet enhances the probability for another planet 
to fo rm, and/or if solar sy stems tend to be rather planar 
(e.g.. ILissauer et ani2011[ ). In this paper we examine the 
predictions of a planar and low mutual inclination plan- 
etary systems under the assumption that planet occur- 
rence is independent. In such planar and low mutual in- 
clination independent models, while the true occupation 
number is Poisson, the transiting planet distribution is 
not, since planarity implies that if a given planet transits, 
others are more likely to transit as well. We compare the 
predictions to Kepler's data. We see that while planarity 
indeed creates a longer tail than in a Poisson distribu- 
tion, it is not yet enough to explain the planet number 
distribution. The structure of the paper is as follows: In 
IJH we present the analytical planar independent model 
and compare it to Kepler's observations in fJD We then 
expand the planar independent model in §4.2! to include 
finite mutual inclinations in the planetary systems. We 
show that neither can explain Kepler's data and some 
correlation is necessary. In ij5] we address the possible 
reason for the model's failure and in fj6]we compare the 
independent model to previous works. Finally in ^ we 
discuss possible origins for these correlations. 



KEPLER'S DATA AND THE INTRINSIC PLANETARY 
DISTRIBUTION 



Foll owing iHoward et al.l ()201lD and iTremaine fc DonsJ 
(|2012f ). we restrict Kepler's d etections to the wel l deter - 
mined F, G and K stars. Like ITremaine fc Pond ()2012D . 
wc only investigate stars that have log g > 4 , 4000°i^ < 
Teff < 6500°A' and Kepler magnitude 9 < Kp < 16, 
w here the data was ta k en from Kepler ' s pub lished result 
in lBrownet all (|20lTI) : iBatalha et all (|20T2| ) 0. 

Since completeness of the sample is required for a 
proper statistical analysis, we must restrict the discus- 
sion to planets with short enough periods. This restric- 
tion insures that the data can be assumed to be complete, 
i.e. further data releases will not change significantly the 
observed data up to the cut-off. In order to evaluate, 
''max ,the upper boundary of our sample, we calculate the 
cumulative number of planets which satisfies the above 
criteria as a function of r - the semi-major axis of the 
planet' s orbit in units of R ^, - the stellar radius. Fol- 
lowing iHoward et al.l (j2011| ) , we count each planet with 
a semi-major axis r as r planets, to account for planets 
in other systems in which the pla nets are not transit- 
ing d ue to unfavorable inclination (jBorucki fc Summer j 
Il984| ). Note that this factor should be applied to each 
transiting planet regardless of the planarity of the system 
and the correlations it creates. In figH] we plot the cal- 
culated cumulative number of planets with radii smaller 
than Rjupiter/'i as well as for planets with radii larger 
than Rjupiter/'i- At a distance of Tmax '~ 75 the smaller 
planets experience a role over, whereas the larger planets 
do not (see figure [ij. Wc interpret this break as the dis- 
tance at which the completeness of the sample, starts to 
break for smaller planets. This interpretation is based on 
the assumption that the distribution of small planets and 
of large planets as a fu nction of their semi major axis is 
similar as was found by lYoudinl (|2011| ) . This break gives 
the upper boundary of our sample which corresponds to 
a period of 75.5 days. 

Since the exact location of the role over is unclear and 
may be dependent on the definition of small and large 
planets, one may argue that the uncertainty in the loca- 
tion of Tmax rnay affect the applicability of our results. 
Therefore, in Sj8l wc discuss the applicability of our result 
in the case where the data is assumed to be complete up 
to distances smaller than 75i?*. We find the conclusions 
to be quantitatively similar for any r^ax > 45i?H.. There- 
fore, from now on and up to [J8l we assume the data is 
complete up to r^ax = 75 and restrict our data analysis 
for distances smaller than that. 

The probability of hosting a planet within the range 
{r, r -|- dr} is given by f{r)dr where /(r) is defined to be 
the occupancy distribution. 

At the range {0,rmax}, we fit /(r)to some piecewisc 
power-low function of the form: 



/w = 



= < 





A 




r <ra 

r e {ra,n} 
r e {rb,rc} 

' -^ 'max 



(1) 



In order to calculate Tq , Vb and Tc , we minimize the re- 
sult of the chi-square between the empirical cumulative 

^ http://archive.stsci.edu/kepler/planct_candidatcs.htirLl 




Fig. 1. — The normalized cumulative number of planets as a 
function of r for small and big planets. The black dashed line 
denotes r = 75, the semi major axis in which the small planets 
experience a role over 




Fig. 2. — The normalized cumulative number of planets as a func- 
tion of a/Rf. The blue solid line is the analytical fit for f f{r)dr 
while the red dots are from Kepler's data. 

distribution function and the theoretical function with 
regards to ra,rb and re- We find: r^ = 3.4, rb = 12.8 
and Tc — 30.8 for fmax = 75. 

Figure 121 shows our fitted cumulative distribution func- 
tion compared to the observed data. It is evident that 
the functional form described by equation [1] is sufficient 
to adequately describe the data. 

Note, that although there is a possibility that the cur- 
rent data may be supplemented by the discovery of ad- 
ditional, non transiting, planets, e.g. through TTV's 
(I Ago! et al I IMol iHolman fc Murravl [2005 1 : iFord et al.l 
l2011ll2012D . this would not affect our analysis since those 
were already taken into account statistically, by count- 
ing each transiting planet as r planets. In this paper, we 
compare the expected distribution of transiting planets 
with Kepler's data. 

3. THE INDEPENDENT PLANAR^ MODEL 
3.1. Model Assumptions 

We propose a very simple model in attempt to explain 
the amount of systems with some number m of transiting 
planets. This simple model allows us to evaluate these 
numbers analytically. 

The basic assumptions of the independent planar 
model are as follows: 

1. All planets in a system are exactly aligned 

- This assumption maximizes the correlation be- 
tween different planets in a system and fortunately. 



it allows us to analytically model the probabilities 
of having a given number of transiting planets in 
a given system. This assumption only serves us in 
the analytic discussion, while later we also discuss 
the case of partially misaligned planetary systems 
(§3.2). 

2. All of the stars and planets are identical - 

Taking only F, G and K stars from the Kepler data, 
we limit ourselves to stars which have compara- 
ble mass and radii. By not including planets that 
are detected above a certain dimensionless semi- 
major axis r, we effectively reduce the effect of the 
planet's size on the detection efficiency. 

3. The Occupancy distribution of a planet exist- 
ing at a given distance from its stellar host, /(r), is 
the same for all the stars which are capable of pro- 
ducing planets and is given by equation ([T]). This 
assumption implies that in a given stellar system 
occurrence of a planet at one position is indepen- 
dent of the occurrence of other planets at other 
distances. 

The basic assumptions of the independent planar model 
are simi lar to some aspects o f models previously sug- 
gested (ILissauer et al.l 120111: iTremaine fc Pond 120121: 
IYoudin|[2011[) . A detailed comparison between this model 
and its results to those of previous works, are discussed 
ongH 

3.2. P{rn) - The probability for m transiting planets 

We find P{m), the probability that a given star hosts m 
transiting planets, for a general model, in which the oc- 
cupation probability function has a general form of f{r). 
In the following we denote distances in units of R^ . 

The zero mutual inclination assumption guaranties 
that for each orientation of the plane, there is a max- 
imal distance from the star beyond which a planet does 
not transit. 

The probability for m transiting planets around a given 
star is therefore: 



P{m) 



' , I Probability density that the maximal 
transit distance is r. 



Probability that m planets 
exist at distances smaller than r. 



(2) 

The probability that m planets exist at distances 
smaller than r can be found from the mean number of 
planets up to distance r 



F{r) 



dr' fir'). 



(3) 



Assuming no mutual influences between planets, the 
number of planets up to some radius is a Poisson vari- 
able. Therefore, the probability that m planets exist at 
distances smaller than r is: 



Probability that m planets 
exist at distances smaller than r 



F{ry 



.-F{r) 



(4) 

Given a random orientation of the plane of the system, 
the probability of an object located at radius r to transit 



is 1/r (jBorucki fc Summers! 1 198^ . Therefore, the prob- 
ability density that the maximal transiting distance is r 
is given by l/r^. Substituting this result and equation 
([4]) into equation ([2|) we obtain: 



P{m) 



dr 



F{rY 



-Fir) 



(5) 



In reality, there is a radius, Tmax, beyond which tran- 
siting planets are less likely to be detected due to the 
finite duration of the observations. Therefore, we lim- 
ited our sample to r < rmax so that f{r> rmax) = 0. 
Consequently, F{r > r,nax) = F{rmax), and equation ([5]) 
becomes: 



P(m 



^0 



dr 



F{ry 



-F(r) 



F(r Y 



-F(r„ 



(6) 



3.3. P{m) for our Occupancy Distribution 

So far, we have found P{m) for a general form of f{r). 
At this point, we find P{m) for our /(r) given by equa- 
tion du. 

P{m), is a function of / via F. Therefore, we can 
decompose it into four different parts 

P(m) = Pa{m) + Pb{m) + P,{m) + P4m) , (7) 

where Pa{m), Pb{m) and Pc{m) correspond to the differ- 
ent regimes of f{r), and Pd{m) is related to the last term 
in equation ([6]). F{r) is simply given by: 



F{r) 



dr fir) 

I 

rG{0,rJ 

A{r^-rl) re{ra,rb} 

Fin) + Hr - Tft) re {rb, rj 

F{rc) + Xrc \n{r/rc) r e {r^ rmax} 



(8) 



where A = A/3r^. This gives us: 



Pa{m) 
Pb{m) 
Pcim) 



dr 
dr 



pm 

r^ml 

pm 



dr 



pr, 



-F 



P^[rn) ^ ^(^"^a^r e-f(fin 



nnaxm! 



(9a) 
(9b) 
(9c) 
(9d) 



Since the number of systems with zero planets is vastly 
larger than these with the planets, we add another free 
parameter, C, which determines the fraction of stars that 
are able to produce planets with efficiency A. Note that 
we still require that stars that are capable of producing 
planets to be identical. In this case: 

P^{m) = 
( C [Pa{m) + Pb(m) + Pc{m) + Pd{m)] for m 7^ 

I 1-E™'=i^''(™') form = 

(10) 



TABLE 1 

The number of systems with m transiting planets as 

OBSERVED BY KePLER, NobsilTT-), AS WELL AS THE NUMBER OF 

systems with A GIVEN AMOUNT OF transiting PLANETS AS 

EXPECTED FROM THE INDEPENDENT PLANAR MODEL, N^^pi'm,), ALL 

UNDER OUR CONSTRAINS AS PRESENTED IN J2] 



m 


Af°''=(m) 


N^^Pim) 





150,000 


150,000 


1 


1,185 


1137.8 


2 


204 


271.9 


3 


69 


62.1 


4 


20 


11.96 


5 


8 


1.94 


6 





0.27 



In a sample of Ns stars, the expected number of systems 
that contain m transiting planets is 



Ne.pim) = NsP^im) 



4. COMPARISON TO OBSERVED DATA 



(11) 



Now that wc have an analytical independent planar 
model for N^xp{m) as a function of A and C, we can 
compare this model to the observed data. 

The number of systems, with a given number of tran- 
siting planets as observed by Kepler which satisfy our 
completeness constrains, as presented in ^ is shown 
in table [1] Note that the number of systems that do 
not contain any planets is hard to estimate due to the 
fact that not all of t he stars were observed continuously 
(jBatalha et al.|[2012D . We set this number by hand to be 
1.5 • 10^, where the exact number may differ from this 
amount and be between (1.28 — 1.9) • 10^. However, the 
exact number is degenerate with our parameter C and 
has no influence on the parameter A, therefore it does 
not affect the applicability of our result. 

4.1. Zero Mutual Inclination 

In order to find A and C which best fit the observations, 
denoted by Aq and Co 0, we define a likelihood function 
Lohs(A, C), which represents the likelihood of having Nobs 
systems with rh transiting planets, given an expectation 

value Nexp{X,C) 



LoU\C)^ f] 



Nobs{m)\ 



-N,^p{m) 



(12) 
This definition arise due to the fact that Nexp{m) <^ N^ 
implying that N(m) is a Poisson variable with an ex- 
pectation value Nexp{m). We find that Lobs{^,C) has a 
maximum when Aq = 1.78-10^^, Co = 0.37, and its value 
is Lobs(Ao,Co) = 1.88 • lO^^^ Figure [3] shows the best 
fit model compared to the observations of Kepler along 
with the best fit Poisson distribution. As expected, the 
planar model fits Kepler's data better than the Poisson 
distribution since planarity adds an additional correla- 
tion. The data agrees with the fit for m = 1 transiting 
planets, overestimate the number of pairs and underes- 
timate the number of systems with to > 3 transiting 



■^ The subscript "0' 
planets in the system 



is for zero mutual inclination between the 
Later on, when we discuss non-zero mutual 

inclination between the planets, the subscript "0" will be replaced 

by/. 




Number of planets 



Fig. 3. — The number of systems with m planets. Kepler's data 
is denoted by the blue dots while our analytical theory is plotted 
with the red dashed line. The black dashed line shows the best fit 
Poisson distribution for comparison. The error bars are the 95% 
confidence level assuming each bin has a Poisson distribution. Note 
that the number of double systems is significantly underestimated, 
by more than 4(t. 



planets (see also iJohansen et al.l (I2012D for similar re- 
sults). This can be understood as a partial failure of the 
independent planar model in its attempt to overcome the 
long tail problem that ruled out the Poisson distribution 
model presented before. In other words, the independent 
planar model does create an inherently longer tail com- 
pared to the Poisson distribution, but the tail is not long 
enough. 

In order to estimate whether or not this deviation of 
the data from the predictions of the model is significant, 
we compare Lobs(Ao, Co) to the typical likelihood; i.e the 
expectancy value of the likelihood of sets of large num- 
ber of N(jn) drawn from a Poisson distribution with ex- 
pected value Nexp{m). 

The typical likelihood can be calculated as follows. 
The number of systems with m transiting planets is a 
Poisson variable with a mean Nexp{m). The probabil- 
ity of detecting TV systems with to transiting planets, is 
[Nexp{m)^ /Nl]e~^'--p^'^\ therefore the contribution to 
the typical likelihood that comes from systems with m 
transiting planets is 

L'"(Ao,Co) = 

m 




probability that 

A^ systems 

contain m planets 



(13) 



where Ng is the total number of stars in the sample. 
Since the probability that TV systems contain m planets 
is, again, [Nexpim)^ /N\]e~-^""'''"^\ we obtain: 



N, 



L7lpi^o,Co) — y ^ 



f [N,xp{m)r ^^N.^^M 



N=0 






TV! 



(14) 



Since the typical likelihood is given by the product of 
L5^p(Ao,Co) for aU possible to's, we obtain: 



LtypiXo, Co) — J_ J_ 



Y f [Nexp{rn)]^^-N^^,{,ni ' 



.N=0 






TV! 



(15) 

Wc find LtypiXo,Co) = 5.18 ■ 10"^. We define the ra- 
tio Lobs(Ao,Co)/-^typ(Ao,Co) to be the success rate of 




Fig. 4. logio{^o6s) as a function of A and C for zero mutual 

inclination. The red dashed line shows A oc C~^ . 

the model. This parameter is a proxy to the vahdity of 
the model, values around unity indicate that the model 
fits the data well, while small values indicate improbable 
models. Since in our case the success rate is 3.64-10"^, we 
find that the independent planar model which assumes a 
planar system with independent planet occurrence, can 
be disqualified with high certainty. 

An interesting property of the likelihood function is 
that it depends strongly on the product A • C. This prod- 
uct is proportional to both the total number of planets 
per star and to the number of transiting planets per star. 
Specifically, for our function /(r) given by equation ([T]), 
46.5CA is the mean number of planets and 1.9CA is the 
mean number of transiting planets, regardless of the mu- 
tual inclination. It is only the distribution of the num- 
ber of transiting planets per system that depends on the 
mutual inclinations. Along the curve A oc 1/C, the like- 
lihood docs not change significantly. Figure |4] shows the 
value of the likelihood function in the A — C plane. 

4.2. Finite Mutual Inclination 

In nature planetary systems are more likely to have 
a small mutual inclination between planets due to var- 
ious processeflj. There is not much observational infor- 
mation about the m utual inclinations be t ween planets. 
iHolman etHI ()2010[ ) and iLissauer etlUI (|2011| ) placed 
an upper limit of / < 10° on the mutual inclination of 
the sy stems Kcplcr-9 and Kepler- 11, and iBatalha et ahl 
(|2011l) found / = 5° - 20° for the Kepler 10 system. 
However, no statistical analysis has been able to rule out 
the possibility that the typical mutual inclination may 
be larger. 

We now extend our treatment to include systems 
with mutual inclinations between planets. An analytical 
derivation for the general case is much more complicated 
than the special case with zero mutual inclination; the 
results in this section are purely numerical. 

We expect that by accounting for finite mutual inclina- 
tion the model of independent planets would deviate even 
more from observations, since it will reduce the depen- 
dency in the detection process which will lead to fewer 
systems with a large number of transiting planets. 

The mutual inclination is incorporated by having each 
planet randomly change its inclination relative to the sys- 
tem's plane. The random inclination is drawn from a 

^ Processes that excite the eccentri city usually increase 
the mutual inclination between planets IJGoldreich et al.l 120041 : 
IChatteriee et al.ll2008l : IJuric fc TremaindUOOa e.g..). 



Rayleigh distribution with a mode /. We inspect sys- 
tems with / G {2°, 5°, 10°, 20°}, this adds an additional 
free parameter to the independent planar model. 

In order to evaluate P'^ (m), the probability for m tran- 
siting planets around a given star, in the case of non 
zero mutual inclination we use a numerical simulation. 
In each run of the simulation, we produce a planetary 
system that generates planets with a probability C. If 
the system is capable of producing planets, we construct 
^ 4300 bins ranging from Tq to r^ax in such a way that 
the probability for each bin to host a planet is the same 
and in accordance with /(r) as presented in equation 
([1]). Each bin is then assigned a random number that 
determines if it hosts a planet. 

We then randomly draw an inclination for the whole 
planetary system from a uniform distribution of — 1 < 
cos(is) < 1 where is is the system's inclination relative 
to the observer. For each planet we randomly draw an 
orbital plane whose inclination relative to the plane of 
the planetary system, /, is drawn from a Rayleigh distri- 
bution and its argument of the ascending node is drawn 
from a uniform distribution. The inclination between 
the planet's orbital plane and the observer, ip, is then 
computed. 

Counting the number of planets satisfying |rcosip| < 
1, we calculate how many planets transit in that specific 
system. This process is then reproduced for the number 
of stars in the sample Ng c^ 1.5-10^, and by averaging this 
simulation on lO'^ runs we find numerically N^ (m), the 
expected number of systems with m transiting planets, 
under the assumption of a typical mutual inclination I. 

After finding N^ (m) we find the maximum likelihood 
of the observations using equation ()12p where Npxp{m) 
is replaced by N^ (m) and denote the A and C that 
correspond to this maximum to be A/ and Cj. 

As was done in the previous section, we compare 
^ibsi^i^^i) ^° ^^^ typical likelihood expected for /, Xj 
and Ci (using equation (jisp where Nexpirn) is replaced 
by NI (m)). As done in §4.1) we define the ratio 

^ibsi^i^ ^i)/^iypi^ij^i) to be the model's success rate. 
The opposite extreme to the independent planar model 
is the case of an isotropic distribution of planets. It can 
be described by a simple analytical expression. If planets 
have an isotropic distribution, then each cell has some 
small chance (but independent from the other cells) of 
hosting a transiting planet. This naturally gives rise to a 
Poisson distribution. However, a Poisson distribution by 
itself hardly fits the observational data (jLissauer et al.l 
l2011f ). The analog of an isotropic distribution to the 
discussed model is having a fraction, C, of stars being 
capable of hosting planets. Those that are capable of 
hosting planets have a Poisson distribution of the number 
of transiting planets with a mean fj,. The probability of 
a system hosting m transiting planets is then 



pCl^m) = Ci 



— for 771 7^ 

P^(0) = 1 - ^ P^im') (16) 

m'=io for in ~ 0. 

= (1-C)+Ce-'' 



Inclination 


A/ 


Ci 


Success Rate 


0° 


1.78 • 10-2 


3.7-10-1 


3.64 lO-* 


2° 


(2.3 ±0.1) -10-2 


(2.84 ±0.05) -10-1 


4.1 10-10 


5° 


(3.7 ±0.11) -10-2 


(1.78 ±0.05) -10-1 


1.51 ■ 10-11 


10° 


(6.5 ±0.22) -10-2 


(9.8 ±0.3) -10-2 


3.94 10-12 


20° 


(1.24 ±0.04) -10-1 


(5.3 ±0.16) -10-2 


3.39 ■ 10-12 


Isotropic 


2.78 ■ 10-1 


2.36- 10-2 


2.93- 10-1* 



TABLE 2 

Summary of our best fit results for different mutual 

inclinations. a/ is proportional to the probability of 

having a planet in a given cell, cj is the fraction of stars 

that are capable of hosting planets and the success rate 

IS THE RATIO L^^^^{\j , Cj)/ Lf {\j , Cj) WHICH REPRESENT THE 

PROBABILITY THAT THE INDEPENDENT MODEL FITS THE 

OBSERVATIONS. 

The Poisson parameter fi is related to Xisotropic by: 

fir) 



Ai = 



dr- 



(17) 



Table [5] contains A/, C/ and the success rate for the 
cases of / e {2°, 5°, 10°, 20°} as was found from the 
numerical simulations. In addition, the table also con- 
tains the parameters for the planar case, as was found in 
§4.11 as well as for the isotropic case. 

4.3. Conclusions from the comparison to the 
observations 

As one can see from table [2l all possible inclinations 
are very unlikely (success rate<^ 1). Therefore, our main 
conclusion is that Kepler's observations can not be ex- 
plained by a model which assumes that planet occurrence 
is an independent process and all planet producing stars 
are identical. Some additional correlation is required. 

In the next section, we discuss briefly the types of cor- 
relations that might explain Kepler's data. 

5. POSSIBLE EXPLANATIONS 

The inapplicability of the independent models pre- 
sented before, might arise due to differences in the star's 
planet producing efhciency, and/or if planets in a given 
solar system affect each other, either in terms of creation 
or dynamical evolution. In this section, we present two 
possible explanations that may cause the failure of the 
independent model, one which arises from dependency 
between planets in a given solar system and one which 
arises from differences in the star's planet producing ef- 
ficiency. 

5.1. Mutual Planet Dependence 

Previous works have shown that planets in a given 
stellar sy stem tend to be re latively close t o res - 
onances (iLissauer et al.l 120111 : iFabrvckv et al.l l2012f ). 
iLatham et al.l ()2011[ ) have shown that systems that con- 
tain giant planets (in particular hot Jupiters) tend to be 
single systems rather than multi. This indicates that mu- 
tual planet dependence might play a role, i.e. planets in 
a given stellar system affect the semi major axes of one 
another. In this section, we investigate this correlation. 

Assuming that planets occurrence is an independent 
process and taking into account only pairs of transiting 
planets, we can calculate the distribution of the ratio 
r-ijrx^ where r-i and r\ are the semi major axes of the 
distant and the close planets respectively. This distribu- 
tion, which depends on /(r), can be found numerically 




Fig. 5. — The distribution of the ratio r^lrx. The red line 
represents the probability density of Kepler's 204 pairs and the 
blue line represents the probability density for our synthetic pairs. 

and compared to the distribution of r2/ri from the pairs 
in Kepler's data. 

In figure [SJ we present the distribution of the ratio 
r^jrx from Kepler's 204 pairs (red line) compared to the 
distribution of the ratio r2/ri from our synthetic pairs 
(blue line). 

The distribution from Kepler's data has a deficit com- 
pared to the synthetic pairs both in the similar radius 
pairs as well as the distant pairs. Using a two sample KS 
test, we find that the two distributions can not arise from 
the same distribution (the null hypothesis is rejected at 
level ~ 10~^°). This conclusion rejects any model which 
assumes that the location of planets in a given system 
is independent on the existence of other planets in the 
system. 

The nature of the dependency in a given system can 
be describe qualitatively. We know that planets can not 
be too close to e ach other due to co nsideration of dy- 
namical stability (jLissauer et al.|[2"01lL e.g.), this explains 
the deficit in pairs with the same radius. On the other 
hand, it is evident that planets in a given system tend 
to be closer to one another than expected from an inde- 
pendent distribution, this is also demonstrated by their 
tendency to be near low-order mean-moti on resonances 
(jLissauer et al.|[20Tll : IFabrvckv et al.|[20T^ , this explains 
the deficit of planets with a large radius ratio. 

Note that a dependency in the location of different 
planets in a given solar system naturally affects the prob- 
ability of planet detection. The expected number of sys- 
tems with some specific number of planets m, is influ- 
enced by this dependency. Therefore, we find this de- 
pendency to be at least a partial cause to the observed 
deviations from the independent models expectations. 

5.2. Fertile and Sterile Systems 

Some previous works have argued th at metal rich stars 
are more likely to produce pl anets (iFischer fc Valentil 



2005t I Johnson et all 2 010: Schl aufman fc Laughlinll2011t 
Johnson fc Lill2012l: IBuchhave eral.ll2012l ). It is not un- 



reasonable to assume that the normalization or even the 
functional form of /(r) may depend on the metallicity of 
the star. 

In order to see whether or not a metallicity dependence 
of A is sufficient to produce the observed correlation, we 
divide the planets producing stars into two different pop- 



A 



C 



Planar 4.3 ■ IQ-^ 
Isotropic 5.2-10-2 



s/L 



6.7 
8.1 



7.2 ■ 
7.4- 



iyp 



20% 
10% 



0.20 
0.11 



TABLE 3 

Summary of our best fit results allowing two different 

planet producing efficiencies. q is the ratio of planet 

PRODUCING EFFICIENCY AND Fq IS THE FRACTIONAL PORTION OF 
THE HIGH EFFICIENCY POPULATION. 

ulations characterized by two different values of A. Note. 
that this model is not an exact model, but rather an or- 
der of magnitude calculation done in order to see weather 
metallicity's dependence as the one presented by previ- 
ous works, may explain Kepler's results. 

The planet producing efficiency seems to increase by a 
significant factor at metallicity \Fe/H] = .1 compared 
io \Fe/H] = -0.18 ((Fischer fcValentil 12001. According 



to iPilachowski et al 



(|2012[ ). the metallicitv distribution 
among Kepler's stars is [Fe/H] = -0.18 ± 0.28. Guided 
by these results, we divide the planets producing stars 
into two categories: one, which contain 80% — 90% of 
the planets producing stars and characterized by planet 
producing efficiency A, and the other, contain a fraction 
of Fq ^ 10% — 20% of the planet producing stars, which 
characterized by planet producing efficiency Q\, where 
Q is in the range 3 — 10. Maximizing the likelihood func- 
tion (equation [T2|). we find a good fit to the observations 
{Lobs ~ 10~^ivtyp) both for an independent planar model 
as well as for isotropic model (sec table |3|). 

It is therefore conceivable that differences in the metal- 
licities between different stars may be responsible for the 
correlations we observed in the data. 

Note that the correlation between metallicity 
and planet producing efficiency is not unambigu- 
ous and resent works h ave s hown d ifferent results. 
ISchlaufman fc LaughlinI (|2011[ ) and iBuchhave et al.l 
(|2012f ). show that this correlation only holds for Nep- 
tune sized planets and above, or for M dwarf stars. 
Since Kepler's data contains mostly smaller planets 
where the metallicity has a weaker influence on the 
star's planet producing efficiency, another mechanism 
with such significant influence on the A distribution, 
may explain the deviations from the independent model. 

6. COMPARISON WITH PREVIOUS WORKS 

iLissauer et al.l ()2011l ) used the data from Kepler's first 
data release to estimate statistically the coplanarity of 
planetary systems. Their sample included all planets or- 
biting stars with i?, < lOi?©, with periods 3 < P < 125 
days, radii 1.5 < Rp < & Earth radii and S/N > 16 
(as listed for Q0-5[j ). They numerically create a syn- 
thetic population of planets by assigning each star in the 
Kepler's target list a number of planets drawn from ei- 
ther a Poisson, uniform or exponential distribution. The 
planets are then given periods and sizes to match the ob- 
served population while maintaining a miniinum radial 
separation between planets in order to preserve the sys- 
tem's stability. Each planetary system is then randomly 
oriented and each planet is given a random inclination. 
The simulated planets are then checked to see if they 
would have been detected by Kepler. They find that 
a low-mutual inclination gives the best result and that 

® QO-5 symbolizes data taken from the first six quarters of Ke- 
pler. 



only 3% — 6% of stars host planetary systems. How- 
ever, they can not rule out the possibility that Kepler's 
data contains stars with a large amount of planets and 
a high mutual inclination. This work is most similar to 
our approach, especially the case where they choose the 
number of planets is chosen from a Poisson distribution, 
which is suitable for independent planets. The reason 
that we arrive at d ifferent conclus i ons it that we use a 
larger data set than ILissauer et aTl ()20lH ). We have also 
implement an analytical model ra ther than a numerical 
one which simplified the analysis. iLissaueret al. (20111) 
and us differ in our approach towards addressing the is- 
sue of data completeness. We empirically find the dis- 
tance a^_wiiich_small planets are no longer detected and 
while ILissauer et al.l ()201lD mimic the stellar and planet 
population of Kepler and simulate the transits of a given 
planet. 

Using the same stella r selection criteria as we do, 
iTremaine fc Don3 ()2012D analyzed planets from Kepler's 
first data release with radii smaller than 2 Jupiter ra- 
dius and periods up to 200 days. Taking into account 
geometric corrections, they fit a "planetary distribution 
function" , a distribution function that specifies the prob- 
ability of a star to host a given number of planets, to the 
observed data. This approach is in some sense, the oppo- 
site extreme to ours. They allow for any planet number 
distribution, while our approach which assumes identical 
stars and independent planet results in a Poisson distri- 
bution of the number of planets. It is for this reason that 
they find that the data can be explained by a small num- 
ber of stars hosting a large amount of high mutual incli- 
nation planets as well as a large amount of stars hosting 
a smaller amount of planets in a rather planar configura- 
tion. Like this work, they assume independence for the 
position of planets in a given stellar systems; i.e. the oc- 
currence of one planet did not influence the semi major 
axes of the others. 

However, in ij5.1l we have shown that there is a depen- 
dency between planets in a given planetary system. This 
finding seems to rule out the independent pos ition as- 
sumption used both bv ITremaine fc Dongl (j2012| )'s model 
a s well as our si mple independent planet model. 

lYoudinl (|201lD has tried to find the form of the "un- 
derlying planetary distribution function" . This function 
is somewhat similar to our /(r), but it also contain a 
dependency on the radii of the different planets; i.e. it 
describes the probability of hosting a planet with a given 
radiu s, Rp, a t some period P. In order to find this func- 
tion, lYoudinl ()2011f ) only needed information regarding 
the total number of planets. Therefore, he has used the 
total number of planets under a detailed analysis of sur- 
vey selection effects. Our work tries to answer a different 
question. We assume the simplest underlying planetary 
distribution function that matches the observed period 
distribution and check to see if it is consistent with the 
observed multiplicities in the Kepler's data. This allows 
us to rule out the possibility that planet occurrence is 
independent of other planets. 

7. SUMMARY AND CONCLUSIONS 

It is clear that transiting planet occurrence cannot be a 
totally independent process; transiting planets do not oc- 
cur randomly in systems, since independent process gives 
rise to a Poisson distribution in the number of systems 



with m transiting planets, rather additional dependen- 
cies are required. 

The simplest dependency one may assume is a perfectly 
planar solar system. In this paper, we found analytically 
the probability that a given star hosts m transiting plan- 
ets, P{m), assuming that planet occurrence is an inde- 
pendent process and allowing for planar systems. The 
probability, P(m), in this model, depends only on the 
parameter A, which represents the planet producing efB- 
ciency of all stellar systems. The planar assumption was 
not adequate in explaining Kepler's observations, so we 
added another free parameter C, which determines the 
fraction of stars that are able to produce planets. This 
model also failed to explained the observed data. 

We conclude therefore that Kepler's observations can 
not be explained by any model which assumes that planet 
occurrence is an independent process with the same effi- 
ciency A for all planet producing stars. Some additional 
correlation is required. We discuss two possible correla- 
tions that might explain the deviations from the simple 
independent model. 

The first possible correlation is some mutual influence 
of planets in a given stellar system. In attempt to find 
evidence for such mutual influence, we compared the ra- 
tio between semi-major axes in Kepler's pairs, to that of 
a synthetic population which were numerically produced 
under the assumption of no mutual influence. We find 
that Kepler's data does not match the synthetic popu- 
lation. Although this might not be the only deviation 
from the independent model, we speculate that this mu- 
tual influence is significant and it has an influence on the 
observed number of multiple planets. 

Another possible correlation is that A is not constant 
among the different stars; i.e. planet producing stars 
are not identical and may produce planets with differ- 
ent efficiencies. Using previous studies about the metal- 
licity influence on planet producing efficiency, we have 
shown that taking into account a metallicity distribu- 
tion might explain the deviation of the data from the 
presented model. However, since the influence of metal- 
licity seems to hold only for giant planets, this additional 
dependency should be taken as an example of how an ad- 
ditional dependency can solve the problem rather than 
be interpreted as the real physical dependency. 
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grants and a Packard and Guggenheim Fellowships. E.S. 
is partially supported by an Ilan Ramon grant from the 
Israeli Ministry of Science. 

8. APPENDIX: DISCUSSION ABOUT COMPLETENESS 

In section iJ^J we have discussed the importance of data 
completeness for statistical analysis as the one presented 
in this work. In our work, flnding Tmax, the maximal semi 
major axis for which the data is assumed to be complete, 
was based on locating the point in which the number of 
small planets experience a role over. Since the deflnitions 
of " small planets" and " role over" may be a bit flexible, 
we investigate the validity of our results for r^ax which 
are different from 75i?*. 

The validity of the model is given by the successes rate, 
i.e. the maximal likelihood of the observations when A 
and C are free parameters, compared to the typical like- 
lihood. Therefore, in order to evaluate the influence of 
10° , 
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Fig. 6. — The success rate, i.e the UkeUhood of the best fit 
independent planar model compared to the typical likelihood as a 
function of rmax, the maximal semi major axis for which the data 
is assumed to be complete. 

the assumed rmax on the model, we find the success rate 
of the model for every r„iax £ {15i?,, 75i?*} using the 
same procedure presented in §4.11 The success rate of 
the independent planar model is presented on figure \&\. 
From flgure \6\ one can deduce that for rmax > 25i?* , 
the success rate is smaller than 10^^ and for r,nax > 45i?* 
it is smaller than 10^^. Therefore, changing r^ax from 
the our canonical value of 75i?* by a factor of order unity, 
seems to have a small influence on our qualitative results. 
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